Apparatus and method for presenting media content with telepresence

ABSTRACT

A system that incorporates teachings of the present disclosure may capture images using a camera system at a location associated with a user, transmit video content representative of the images over a network for presentation by another media processor at another location, receive at a media processor media content and second video content representative of second images that are associated with the second user, and present at a display device of the location the media content and the second video content in a telepresence configuration that simulates a presence of the other user at the location, where at least one of the media content and the video content is presented as three dimensional content. Other embodiments are disclosed.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is related to co-pending U.S. Application entitled“APPARATUS AND METHOD FOR PRESENTING THREE DIMENSIONAL OBJECTS WITHTELEPRESENCE” by Hines et al., Attorney Docket No.2011-0168_(—)7785-0851, the disclosure of which is hereby incorporatedby reference herein in its entirety.

FIELD OF THE DISCLOSURE

The present disclosure relates generally to media content communicationand more specifically to an apparatus and method for presenting mediacontent with telepresence.

BACKGROUND

Media consumption has become a multibillion dollar industry thatcontinues to grow rapidly. High resolution displays such as highdefinition televisions and high resolution computer monitors can nowpresent two-dimensional movies and games with three-dimensionalperspective with improved clarity. Collectively, improvements indisplay, audio, and communication technologies are causing rapid demandfor consumption of all types of media content. As consumption increases,the desire to share with others the consumed media content andinformation related to the media content also increases.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 depicts an illustrative embodiment of a communication system thatprovides media services with telepresence;

FIG. 2 depicts an illustrative embodiment of a presentation device andmedia processor for presenting media content that can be used in thesystem of FIG. 1;

FIG. 3 depicts an illustrative embodiment of a viewing apparatus thatcan be used with the presentation device of FIG. 2;

FIG. 4 depicts an illustrative embodiment of a presentation device witha polarized display that can be used in the system of FIG. 1;

FIGS. 5 and 6 depict illustrative embodiments of communication systemsthat provide media services with telepresence;

FIG. 7 depicts an illustrative embodiment of a method operating inportions of the devices and systems of FIGS. 1-6; and

FIG. 8 is a diagrammatic representation of a machine in the form of acomputer system within which a set of instructions, when executed, maycause the machine to perform any one or more of the methods describedherein.

DETAILED DESCRIPTION

The present disclosure describes, among other things, illustrativeembodiments of methods and devices for providing media content to aplurality of users, where the plurality of users are provided atelepresence of each other by establishing a communication session thatsimulates a co-location of each of the users at each of the userlocations. In one or more embodiments, the media content and/or theimages of the users, or a portion thereof, can be presented as threedimensional (3D) content to enhance the telepresence. In one or moreembodiments, the 3D content can be generated by a remote server and/orcan be generated by each media processor of the users, such as throughuse of a depth map. In one or more embodiments, 3D cameras and/or aplurality of two dimensional cameras at each user location can beutilized for generating 3D content. The media content can be selected inconjunction with a social network application associated with each ofthe users while control over presentation of the media content canremain with one of the users. In one or more embodiments, images of auser can be rotated or otherwise repositioned during presentation inresponse to detecting speech of the user to further enhance thetelepresence by simulating the user facing another user to speak. Inanother embodiment, holographic content can be generated and/orpresented at the user locations. Other embodiments are alsocontemplated.

One embodiment of the present disclosure can entail a server thatincludes a memory and a controller coupled to the memory. The controllercan be adapted to obtain media content, receive first images that arecaptured by a first camera at a first location associated with a firstuser, and receive second images that are captured by a second camera ata second location associated with a second user. The controller can beadapted to provide the media content and second video contentrepresentative of the second images to a first processor forpresentation at a first display device utilizing a first telepresenceconfiguration that includes the second user. The first processor and thefirst display device are associated with the first user and located atthe first location. The controller can be adapted to provide the mediacontent and first video content representative of the first images to asecond processor for presentation at a second display device utilizing asecond telepresence configuration that includes the first user. Thesecond processor and the second display device are associated with thesecond user and located at the second location. The providing of thefirst and second video content can be associated with a communicationsession between the first and second users. At least one of the firstvideo content and the second video content can be adapted forpresentation as 3D content by the corresponding first and secondprocessors.

One embodiment of the present disclosure can entail a method thatincludes capturing first images using a first camera system at a firstlocation associated with a first user. The method can includetransmitting first video content representative of the first images overa network for presentation by a second media processor at a secondlocation associated with a second user. The method can include receivingat a first media processor of the first location, media content andsecond video content representative of second images that are associatedwith the second user. The method can include presenting at a firstdisplay device of the first location, the media content and the secondvideo content in a first telepresence configuration that simulates apresence of the second user at the first location. The media content andthe first video content can be adapted for presentation by the secondmedia processor in a second telepresence configuration that simulates apresence of the first user at the second location. At least one of themedia content, the first video content and the second video content canbe presented as 3D content.

One embodiment of the present disclosure can entail a non-transitorycomputer-readable storage medium that includes computer instructions.The instructions can enable obtaining media content at a server. Thecomputer instructions can enable receiving at the server, first videocontent of a first user at a first location. The instructions can enablereceiving at the server, second video content of a second user at asecond location. The instructions can enable receiving at the server,third video content of a third user at a third location. Theinstructions can enable transmitting from the server, the media contentand the second and third video content to a first media processor forpresentation at a first display device that simulates the second andthird users being present at the first location. The instructions canenable transmitting from the server, the media content and the first andthird video content to a second media processor for presentation at asecond display device that simulates the first and third users beingpresent at the second location. The instructions can enable transmittingfrom the server, the media content and the first and second videocontent to a third media processor for presentation at a third displaydevice that simulates the first and second users being present at thethird location. The instructions can enable adjusting an orientation ofimages of the first user being presented at the second and third displaydevices in response to speech of the first user being detected. Theadjusted orientation can be based on simulating the images of the firstuser speaking to the second and third users.

FIG. 1 depicts an illustrative embodiment of a first communicationsystem 100 for delivering media content, which can include 3D mediacontent. System 100 can present media services that generatetelepresence configurations for different users at different locations,which simulate each of the users being present at the other user'slocation. The simulation can be accomplished in a number of differentways, such as by capturing images and/or modifying images of the usersto simulate the users being at the location viewing media content, asopposed to contemporary video conferences that capture images of theuser at a straight forward perspective. Other simulation techniques canbe utilized in the media services, such as detecting speech of a targetuser and simulating the target user turning to speak.

The communication system 100 can represent an Internet ProtocolTelevision (IPTV) broadcast media system although other media broadcastsystems are contemplated by the present disclosures. The IPTV mediasystem can include a super head-end office (SHO) 110 with at least onesuper headend office server (SHS) 111 which receives media content fromsatellite and/or terrestrial communication systems. In the presentcontext, media content can represent audio content, moving image contentsuch as videos, still image content, or combinations thereof. The SHSserver 111 can forward packets associated with the media content tovideo head-end servers (VHS) 114 via a network of video head-end offices(VHO) 112 according to a common multicast communication protocol.

The VHS 114 can distribute multimedia broadcast programs via an accessnetwork 118 to commercial and/or residential buildings 102 housing agateway 104 (such as a residential or commercial gateway). The accessnetwork 118 can represent a group of digital subscriber line accessmultiplexers (DSLAMs) located in a central office or a service areainterface that provide broadband services over optical links or coppertwisted pairs 119 to buildings 102. The gateway 104 can use commoncommunication technology to distribute broadcast signals to mediaprocessors 106 such as computers, Set-Top Boxes (STBs) or gamingconsoles which in turn present broadcast channels to display devices 108such as television sets or holographic display devices, managed in someinstances by a media controller 107 (such as an infrared or RF remotecontrol, gaming controller, etc.).

The gateway 104, the media processors 106, and/or the display devices108 can utilize tethered interface technologies (such as coaxial, phoneline, or powerline wiring) or can operate over a common wireless accessprotocol such as Wireless Fidelity (WiFi). With these interfaces,unicast communications can be invoked between the media processors 106and subsystems of the IPTV media system for services such asvideo-on-demand (VoD), browsing an electronic programming guide (EPG),or other infrastructure services.

Some of the network elements of the IPTV media system can be coupled toone or more computing devices 130, where a portion of these computingdevices can operate as a web server for providing portal services overan Internet Service Provider (ISP) network 132 to media processors 106,wireline display devices 108 or wireless communication devices 116(e.g., cellular phone, laptop computer, etc.) by way of a wirelessaccess base station 117 operating according to common wireless accessprotocols such as WiFi, or cellular communication technologies (such asGSM, CDMA, UMTS, WiMAX, Software Defined Radio or SDR, and so on).

A satellite broadcast television system can be used in conjunction with,or in place of, the IPTV media system. In this embodiment, signalstransmitted by a satellite 115 carrying media content can be interceptedby a common satellite dish receiver 131 coupled to the building 102.Modulated signals intercepted by the satellite dish receiver 131 can betransferred to the media processors 106 for decoding and distributingbroadcast channels to the display devices 108. The media processors 106can be equipped with a broadband port to the IP network 132 to enableservices such as VoD and EPG described above.

In yet another embodiment, an analog or digital broadcast distributionsystem such as cable TV system 133 can be used in place of or inconjunction with the IPTV media system described above. In thisembodiment the cable TV system 133 can provide Internet, telephony, andinteractive media services.

It is contemplated that the present disclosure can apply to any presentor next generation over-the-air and/or landline media content servicessystem. In one embodiment, an IP Multimedia Subsystem (IMS) networkarchitecture can be utilized to facilitate the combined services ofcircuit-switched and packet-switched systems in delivering the mediacontent to one or more viewers.

System 100 can provide 3D content to the building 102 for presentationand/or can provide 2D content that can be rendered into 3D content byone or more client devices, such as the media processor 106 or the TV108. The 3D image content can be based upon various 3D imagingtechniques, including polarization, anaglyphics, active shuttering (suchas alternate frame sequencing), autostereoscopy, and so forth. Thepresent disclosure contemplates presentation of all or a portion of adisplay in 3D, including utilizing devices that do not require awearable viewing apparatus (e.g., does not require active shutteringglasses).

In one embodiment, system 100 can include one or more image capturingdevices 175 (e.g. a camera) that can capture 2D and/or 3D images of auser and/or other objects at the building 102. Other components can beutilized in combination with or in place of the camera 175, such as ascanner (e.g., a laser system that detects object circumference),distance detector, and so forth. In one embodiment, camera 175 can be agroup of cameras, such as two or more cameras for providing differentviewing angles and/or for providing a holographic image. In oneembodiment, the camera 175 can capture images in 2D which are processedinto 3D content, such as by media processor STB 106 and/or computingdevice 130. In one embodiment, depth maps can be utilized to generate 3Dcontent from 2D images. In another embodiment, the camera 175 can be astereoscopic camera that directly captures 3D images through use ofmultiple lenses. A collector 176 or other component can facilitate theprocessing and/or transmission of the captured images. The collector 176can be a stand-alone device, such as in communication with the mediaprocessor 106 and/or the gateway 104 (e.g., wirelessly and/or hardwired)or can be integrated with another device, such as the media processor106.

Computing device 130 can also include computer readable storage medium180 having computer instructions for establishing a telepresencecommunication session between client devices. The computing device 130can provide media content to a number of different users at differentlocations, such as a user at building 102, via the telepresencecommunication session. Computing device 130 can provide the mediacontent in a telepresence configuration that simulates each of the otherusers (not shown) being present at building 102. For instance, thetelepresence configuration can display the media content and furtherdisplay each of the other users to simulate them watching the mediacontent. In one embodiment, the particular telepresence configurationcan be adjusted by one or more of the users based on user preferences,such as retrieved from a user profile or determined from monitoredviewing behavior.

In one or more embodiments, the media content and/or the images of theusers, or a portion thereof, can be presented as 3D content to enhancethe telepresence. For example, the 3D content can be generated bycomputing device 130 and/or can be generated by media processor 106,such as through use of a depth map. System 100 can include othercomponents to enhance the telepresence experience. For instance,lighting and audio components can be utilized to facilitate capturingthe images and audio from a user. The lighting and/or audio componentscan be controlled by the media processor 106 and/or by the computingdevice 130. User preferences and/or monitored behavior can be utilizedin controlling the lighting and/or audio components.

In one embodiment, the users can be part of a social network and thecomputing device 130 can be in communication with a social networkapplication, such as for selecting the media content to be provided inthe telepresence configuration. In one embodiment, one of the mediaprocessors 106 can maintain control over presentation of the mediacontent in the telepresence configuration, such as pause, fast-forward,rewind, size, resolution, and so forth. In one embodiment, thetelepresence configuration, including providing the media content andthe video content of each of the users, can be performed without usingthe computing device 130 to generate the video content from capturedimages or to combine the media and video content. In one example, thetelepresence configuration can be generated by the media processors anddistributed through a peer-to-peer technique, where the media processorsshare the video content amongst themselves and obtain the media contentfrom one of the media processors or from another source, such as mediacontent being broadcast. In one embodiment, each of the media processors106 of the different users can be in a master-slave arrangement tocontrol presentation of the media content and facilitate generating thetelepresence configuration.

System 100 enables video and/or audio content of the users to beprovided to the other users in real-time to establish a communicationsession while simulating the co-location of the users and providingtelepresence with the media content.

FIG. 2 depicts an illustrative embodiment of a presentation device 202and the media processor 106 for presenting a telepresence configuration210 that can include video content 225 which is captured images of oneor more other users that are at different locations from where thepresentation device 202 is located. The telepresence configuration 210can also include the media content 250. The telepresence configuration210 can simulate the other users being present at the location of thepresentation device 202 through use of the video content 225. Thesimulation can be performed in a number of different ways, includingpresenting the other users in the images as if they were viewing themedia content. The simulation can be facilitated by the positioning ofthe camera 175 and/or by post-capture processing, such as adjusting thevideo content 225 so that the other users appear as being rotatedtowards the media content 250. Other simulation effects can be utilized.For example, the images in the video content 225 can be re-sized,including based on the particular size of the presentation device 202,to further simulate the other users being present at the location of thepresentation device 202. The media content 250 and/or video content 225of one or more users can be provided for presentation in thetelepresence configuration 210 in 3D.

One or both of the presentation device 202 and the media processor 106can include the camera 175 that captures images of the user that areprovided to the other users in their telepresence configuration 210. Thecamera 175 can capture 2D images and/or can capture 3D images. Thecamera 175 can be a group of cameras to capture multiple views,including views to construct a holographic image, such as of the userand/or of objects associated with the user. In one embodiment, thepresentation device 202 can be a holographic display device thatpresents all or a portion of the telepresence configuration 210 asholographic content. The holographic content can allow a viewer'sperspective on a depicted object to change as the viewer moves aroundthe hologram content, just as it would if the object were real.

In the present illustration, the presentation device 202 is depicted asa television set. It will be appreciated that the presentation device202 can represent a portable communication device such as a cellularphone, a PDA, a computer, or other computing device with the ability todisplay media content. The media processor 106 can be an STB, or someother computing device such as a cellular phone, computer, gamingconsole, or other device that can process and direct the presentationdevice 202 to present images associated with media content. It isfurther noted that the media processor 106 and the presentation device202 can be an integral unit. For example, a computer or cellular phonehaving computing and display resources collectively can represent thecombination of a presentation device 202 and media processor 106.

The media processor 106 can be adapted to communicate with accessoriessuch as the viewing apparatus 300 of FIG. 3 by way of a wired orwireless interface, such as through RF and/or light waves 206. Thecommunication can be one-way and/or two-way communication, such asproviding the viewing apparatus 300 with a transceiver 302. A wiredinterface can represent a tethered connection from the viewing apparatus300 to an interface of the media processor (e.g., USB or proprietaryinterface). A wireless interface can represent a radio frequency (RF)interface such as Bluetooth, WiFi, Zigbee or other wireless standard.The wireless interface can also represent an infrared communicationinterface. Any standard or proprietary wireless interface between themedia processor 106 and the viewing apparatus 300 is contemplated by thepresented disclosure.

The viewing apparatus 300 can represent an apparatus for viewingtwo-dimensional and/or 3D stereoscopic images which can be still ormoving images. The viewing apparatus 300 can be an active shutterviewing apparatus. In this embodiment, each lens has a liquid crystallayer which can be darkened or made to be transparent by the applicationof one or more bias voltages. Each lens 304, 306 can be independentlycontrolled. Accordingly, the darkening of the lenses can alternate, orcan be controlled to operate simultaneously.

Each viewing apparatus 300 can include various components associatedwith a communication device including a wireline and/or wirelesstransceiver 302 (herein transceiver 302), a user interface (UI), a powersupply, a location detector, and a controller 307 for managingoperations thereof. The transceiver 302 can support short-range orlong-range wireless access technologies such as infrared, Bluetooth,WiFi, Digital Enhanced Cordless Telecommunications (DECT), or cellularcommunication technologies, just to mention a few. Cellular technologiescan include, for example, CDMA-1X, UMTS/HSDPA, GSM/GPRS, TDMA/EDGE,EV/DO, WiMAX, SDR, and next generation cellular wireless communicationtechnologies as they arise. The transceiver 302 can also be adapted tosupport circuit-switched wireline access technologies (such as PSTN),packet-switched wireline access technologies (such as TCPIP, VoIP,etc.), and combinations thereof.

The UI can include a depressible or touch-sensitive keypad with anavigation mechanism such as a roller ball, joystick, mouse, ornavigation disk for manipulating operations of the communication device300. The keypad can be an integral part of a housing assembly of theapparatus 300 or an independent device operably coupled thereto by atethered wireline interface (such as a USB cable) or a wirelessinterface supporting for example Bluetooth. The keypad can represent anumeric dialing keypad commonly used by phones, and/or a Qwerty keypadwith alphanumeric keys. The UI can further include a display such asmonochrome or color LCD (Liquid Crystal Display), OLED (Organic LightEmitting Diode) or other suitable display technology for conveyingimages to an end user of the apparatus 300. In an embodiment where thedisplay is touch-sensitive, a portion or all of the keypad 308 can bepresented by way of the display.

The UI can also include an audio system 312 that utilizes common audiotechnology for conveying low volume audio (such as audio heard only inthe proximity of a human ear) and high volume audio for hands freeoperation. The audio system 312 can further include a microphone forreceiving audible signals of an end user. The audio system 312 can alsobe used for voice recognition applications. The UI can further includean image sensor such as a charged coupled device (CCD) camera forcapturing still or moving images.

The power supply can utilize common power management technologies suchas replaceable and rechargeable batteries, supply regulationtechnologies, and charging system technologies for supplying energy tothe components of the apparatus 300 to facilitate long-range orshort-range portable applications. The location detector can utilizecommon location technology such as a global positioning system (GPS)receiver for identifying a location of the communication device 300based on signals generated by a constellation of GPS satellites, therebyfacilitating common location services such as navigation.

The transceiver 302 can also determine a proximity to a cellular, WiFior Bluetooth access point by common power sensing techniques such asutilizing a received signal strength indicator (RSSI) and/or a signaltime of arrival (TOA) or time of flight (TOF). The controller 306 canutilize computing technologies such as a microprocessor, a digitalsignal processor (DSP), and/or a video processor with associated storagememory such a Flash, ROM, RAM, SRAM, DRAM or other storage technologies.

In one embodiment, the viewing apparatus 300 can utilize a receiverportion of the transceiver 302 in the form of an infrared.Alternatively, the viewing apparatus 300 can function as a two-waycommunication device, in which case a full infrared transceiver could beutilize to exchange signals between the media processor 106 and theviewing apparatus 300.

The viewing apparatus 300 can utilize a controller 307 to controloperations thereof, and a portable power supply (not shown). The viewingapparatus 300 can have portions of a UI. For example, the viewingapparatus 300 can have a multi-purpose button 312 which can function asa power on/off button and as a channel selection button. A power on/offfeature can be implemented by a long-duration depression of button 312which can toggle from an on state to an off state and vice-versa. Fastdepressions of button 312 can be used for channel navigation.Alternatively, two buttons can be added to the viewing apparatus 300 forup/down channel selection, which operate independent of the on/off powerbutton 312. In another embodiment, a thumbwheel can be used forscrolling between channels.

The viewing apparatus 300 can also include an audio system 313 with oneor more speakers in the extensions of the housing assembly such as shownby references 314, 316 to produce localized audio 318, 320 near a user'sears. Different portions of the housing assembly can be used to producemono, stereo, or surround sound effects. Ear cups (not shown) such asthose used in headphones can be used by the viewing apparatus 300 (as anaccessory or integral component) for a more direct and low-noise audiopresentation technique. The volume of sound presented by the speakers314, 316 can be controlled by a thumbwheel 310 (or up/down buttons—notshown).

It would be evident from the above descriptions that many embodiments ofthe viewing apparatus 300 are possible, all of which are contemplated bythe present disclosure. In one embodiment, the viewing apparatus 300 canbe utilized as part of the image capture process. For instance, thetransceiver 302 can function to transmit a locator and/or calibrationrequest that is wirelessly emitted for receipt by the camera(s) 175 oranother processing device, such as the media processor 106. The emittedsignal can be position information that is utilized to facilitatecapturing images of the target, including adjusting the positioning andfocus of the camera(s) 175 to capture the user and/or another object.

In one embodiment, the presentation device 202 can present holographiccontent that enables different perspectives of a user and/or object tobe viewed depending on the position of the viewer. The holographiccontent can be all or a portion of the telepresence configuration 210,such as only the media content 250 or only one or more of the videocontent 225. As an example, the presentation device 202 can utilizeactive shuttering where different perspectives of an image are presentedduring different time slots which can be synchronized with the viewingapparatus 300. The particular perspective of an image can be viewed viathe active shuttering of the viewing apparatus 300 based on the positionof the viewer, such as detected from the viewing apparatus. An exampleof this is described in U.S. application Ser. No. 12/839,943 filed onJul. 20, 2010, the disclosure of which is hereby incorporated byreference in its entirety. Other techniques and components arecontemplated for presenting holographic content at the presentationdevice 202, including with or without a viewing apparatus 300.

In one embodiment, the images of the user in video content 225 can bemodified, including change of clothing, environment and/or appearance.For example, the images of the other users can be presented but withoutthe viewing apparatus 300 being worn. For instance, other images of theother users, such as in user profiles, can be utilized to modify theimages to fill in pixels where the viewing apparatus 300 was removed. Inanother example, the modification of the images of the video content 225can be based on the media content, such as the images of the other usersbeing presented but wearing a cowboy hat where the media content is acowboy movie. The modifications to the video content 225 can be based ona number of different factors, such as user preferences, and can becontrolled by various entities, such as allowing a user to retaincontrol over any modifications to the presentation of their own imagesand/or allowing a user to control any modification to the presentationof other users.

FIG. 4 depicts an illustrative embodiment of a presentation device 402with a polarized display. A display can be polarized with polarizationfilter technology so that alternative horizontal pixel rows can be madeto have differing polarizations. For instance, odd horizontal pixels 402can be polarized for viewing with one polarization filter, while evenhorizontal pixels 404 can be polarized for viewing with an alternativepolarization filter. The viewing apparatus 300 previously described canbe adapted to have one lens polarized for odd pixel rows, while theother lens is polarized for viewing even pixel rows. With polarizedlenses, the viewing apparatus 300 can present a user a 3D stereoscopicimage. The telepresence configuration 210 of FIG. 2 can be presentedutilizing the presentation device 402.

System 400 illustrates use of multiple cameras 175 for capturing imagesof user 420 from different perspectives or views. The differentperspective images can then be utilized for generating a 3Drepresentation of the user 420. The particular number and positioning ofthe cameras 175 can vary. In one embodiment, one of the cameras 175 canbe a depth or distance camera that is utilized for generating a depthmap associated with the user 420 so that the depth map and imagescaptured by the other cameras can be used in constructing the 3Drepresentation of the user 420.

FIG. 5 depicts an illustrative embodiment of a communication system 500that can provide the telepresence configuration 210 to a plurality oflocations 102, 502 and 503. While three locations are illustrated insystem 500, the present disclosure contemplates two or more locationsbeing utilized. The telepresence configuration 210 for each of thelocations 102, 502 and 503 includes the media content 250 and includesvideo content 225 for the other users. For example, a user 520 atlocation 102 is provided with video content 225 that includes otherusers 525 at locations 502 and 503. The computing device 130 can beutilized to provide the telepresence configuration 210 to each of thelocations 102, 502, 503, such as through receiving captured images ofeach of the users 520 and 525 and distributing the video content 225 andthe media content 250 to each of the locations. As an example, each ofthe media processors 106 can then present the video content 225 and themedia content 250, such as in the side-by-side window arrangement shownin FIG. 2. In one embodiment, the captured images and the media content250 can be combined by the computing device 130 into single content thatis provided to the locations 102, 502 and 503, such as through amulticast, without the need for further arranging the media and videocontent. In one embodiment, separate or a combined stream of the mediacontent 250 and the video content(s) 225 can be provided to each mediaprocessor 106 for combining into the telepresence configuration 210.

In one embodiment, the media processor 106 can instruct the users 520and 525 to sit or otherwise position themselves where they will bewatching the telepresence configuration 210. A position of the user canthen be determined for adjusting the camera 175. A distance to theviewer can be determined, such as through use of time-of-flight, stereotriangulation, sheet of light triangulation, structured light,interferometry, coded aperture, and so forth. Other components can alsobe utilized to facilitate the process, including a depth cameraintegrated with camera 175 or provided as a stand-alone component.

FIG. 6 depicts an illustrative embodiment of another communicationsystem 600 that can present the telepresence configuration 210 atdisplay devices 108 of different users at different locations via atelepresence communication session. System 600 can be overlaid oroperably coupled with the devices and systems of FIGS. 1-5 to receivemedia content 250 and/or video content 225, which is presentable as 3Dcontent. System 600 can include computing device 130 for receiving 2Dmedia content from a media source 650 and for generating (or otherwiseobtaining) a depth map associated with the media content, such as basedon object segmentation. The computing device 130 can encode the mediacontent and depth map (such as into a single video stream in H.264format encapsulated in an MPEG-2 wrapper) and transmit the media contentand depth map to one or more media processors 106, such as throughbroadcast, multicast and/or unicast utilizing network 625. In oneembodiment, the computing device 130 can generate the depth map inreal-time or near real-time upon receipt of the 2D media content, suchas from a broadcast studio. The computing device 130 can also generate adepth map for video content that is captured by the cameras 175 in 2D.

System 600 includes media processors 106 which receive the video streamof the 2D media and video content and the corresponding depth maps. Themedia processors 106 can generate 3D content using the depth maps inreal time upon receipt of the video stream. The media processors 106 canalso detect the capability of display devices (such as through HDMI1.4a) and can adjust the media content accordingly. For instance, if adisplay device 108 can only present 2D content, then the media processor106 may discard the depth map and provide the 2D content to the displaydevice. Otherwise, the media processor 106 can perform the real-timegeneration of the 3D content using the depth map and provide the contentto the 3D capable display device 108. The conversion into 3D contentfrom the depth map(s) can be based upon various imaging techniques andthe 3D presentation in the telepresence configuration 210 can be basedupon various formats including polarization, anaglyphics, activeshuttering (such as alternate frame sequencing), autostereoscopy, and soforth.

In one embodiment, position information associated with one or moreviewers can be utilized to adjust 3D media content, such as adjusting aconvergence of the media content 250 and/or video content 225 based on adistance of the viewer(s) from the display device 108. Calibration canalso be performed using a number of components and/or techniques,including a distance camera to measure distances and/or image camera 175for capturing images of the viewers which can be used for interpolatingdistances.

System 600 has the flexibility to selectively provide 2D content and 3Dcontent to different locations. System 600 further has the flexibilityto selectively provide a combination of 2D and 3D content forpresentation in the telepresence configuration 210 (FIG. 2). Forexample, a user may desire to watch the media content 250 in 3D whileviewing the video content 225 in 2D. The selection of 2D or 3Dpresentation can be based on a number of factors, including devicecapability and type of content. The selection can be made by a number ofdifferent entities, including the users via the media processors 106and/or by the service provider via computing device 130. The selectionof 2D or 3D can also be made by one or more devices of system 600without user intervention based on a number of factors, such as devicecapability, network status, viewing history, and so forth.

FIG. 7 depicts an illustrative embodiment of a method 700 operating inportions of the devices and systems described herein and/or illustratedin FIGS. 1-6. Method 700 can begin with step 702 in which media contentis obtained, such as through transmission over a network from a mediasource. The media content can be various types from various sources. Forexample, the media content can be movies that are broadcast or accessedon demand. In one embodiment, the media content can be still images. Inone embodiment, the media content can be images of an object that can bemanipulated, such as presenting images of a car that can be rotated. Themedia content can be received as 2D content and converted to 3D contentand/or can be received as 3D content. The media content can be receivedby the computing device 130 (e.g., a centralized distribution process)and/or received by one or more of the media processors 106 (e.g., adistributed or master-slave process). It should be understood that thepresent disclosure contemplates the media processor 106 being varioustypes of devices, including personal computers, set top boxes, smartphones and so forth.

At step 704, video content can be received from a plurality of differentmedia receivers 106 at different locations. The video content can bereceived as part of a communication session established between mediaprocessors 106 of each of the different users. Each of the video contentcan be received as 2D content and converted to 3D content and/or can bereceived as 3D content. Each of the video content can be received by thecomputing device 130 (e.g., a centralized distribution process) and/orreceived by one or more of the media processors 106 (e.g., a distributedor master-slave process). The video content can be captured by one ormore cameras 175 at each location, where the cameras are 2D and/or 3Dcameras.

Other components can also be used to facilitate capturing the videocontent, including lighting components and/or audio components, whichcan be controlled locally and/or remotely (e.g., by the computing device130 or a master media processor 106).

At step 706, it can be determined if 3D content has been requested or isotherwise desired. For instance, a user profile associated with eachuser at each location can be accessed by the computing device 130 and/orone or more of the media processors 106 to determine if 3D content isdesired for the media content and/or video content. If 3D content isdesired then at step 708 the content can be processed accordingly. Forexample, if the content received is in 3D format then a determinationcan be made if the format is compatible with the media processors 106and adjusted accordingly. For instance, content can be adjusted to becompatible with a first media processor 106 and a copy of the contentcan be further adjusted to be compatible with a second media processor.If the content is in 2D format then the content can be converted to 3Dformat, such as through use of a depth map or using other techniques. Atstep 710, the media content and the video content can be presented ateach display device of each location in a telepresence configuration,such as configuration 210 of FIG. 2. The telepresence configuration cansimulate each of the users being co-located at each location. In oneembodiment, the telepresence configurations can be adjustable, such asby the user selecting the configuration. The adjustments to thetelepresence configuration can include positioning of the video content,size, resolution, and so forth.

In one embodiment at step 712, the computing device 130 and/or the mediaprocessor 106 can monitor to detect speech of a user at one of thelocations. If speech is detected from a target user, then at step 714the video content can be adjusted (e.g., by the computing device 130and/or the media processor 106) to further simulate the target userspeaking to the other users. This simulation can include depicting thetarget user or a portion thereof (e.g., the user's head) turning to facethe viewer of the display device to speak with them. In one embodiment,the telepresence configuration can provide images of the rear of theother user's head's as if they were watching the media content and thenpresent the face of the target user when the target user is speaking. Inone embodiment, images of a front of a user's head can be used togenerate video content depicting the back of the user's head, such asthrough determining shape, circumference, hair color and so forth.

Upon reviewing the aforementioned embodiments, it would be evident to anartisan with ordinary skill in the art that said embodiments can bemodified, reduced, or enhanced without departing from the scope andspirit of the claims described below. The embodiments described abovecan be adapted to operate with any device capable of performing in wholeor in part the steps described for method 700.

In one embodiment, the device(s) that perform the functions describedherein can be selected based on capability. For example, if all mediaprocessors 106 have the ability to generate 3D video content then adistributed process can be utilized that does not utilize the computingdevice 130 for generating 3D content. As another example, if only aportion of the media processors 106 have the ability to generate 3Dcontent then a master-slave arrangement can be established between themedia processors 106 without the need to utilize the computing device130 to generate the 3D content. As another example, if none of the mediaprocessors 106 have the ability to generate 3D content then thecomputing device 130 can be utilized for generating 3D content.Similarly, 2D images captured by a 2D camera can be transmitted to adevice capable of generating 3D video content, such as the computingdevice 130 and/or another media processor 106. In one embodiment, theselection of the device(s) can be based on other factors, includingprocessing resources, workload, type of content and so forth. Forexample, if only one media processor 106 has the capability to generate3D content then the computing device 130 may be utilized along with theselect media processor for generating the 3D content.

In one embodiment, the selection of the media content can be performedin conjunction with a negotiation process amongst at least a portion ofthe users that are intended to receive the telepresence configuration.For example, the users can vote on the media content to be presented. Inanother embodiment, priority can be provided to particular users for thenegotiating process, such as priority based on device capability. Asanother example, past voting history can be used as a factor in theselection of the media content, such as weighting votes more heavilywhen the user has been unsuccessful in voting to select media content inthe past.

In one embodiment, the selection of the media content can be based onfactors associated with one of the users. For example, the other usersmay desire to wish happy birthday to a target user. A telepresencesession can be established with the target users and the other users inwhich the media content is a particular singer singing a birthday songto the target user. The selection of the singer can be done based on apreference of the target user, including based on monitored consumptionhistory by the target user of songs.

In one embodiment, the providing of the telepresence configuration canbe done in conjunction with a social network application. For example,each of the users can be members of the social network and theestablishing of the communication session between the different userscan be initiated based on selections made from the social networkapplication.

In one embodiment, the presentation of the telepresence configuration bya media processor 106 can be done at multiple display devices. Forexample, in a system that has three display devices positioned adjacentto each other, the media processor 106 can provide a middle displaydevice with the media content for presentation while providing the enddisplay devices with each of the video content from the other users tosimulate the other users being co-located at the location of the mediaprocessor 106.

In one embodiment, a plurality of formats can be generated for the mediacontent 250. The different formats can be based on the capabilities ofthe media processors 106 and/or the display devices 108. For instance,holographic content may be generated for the media processor 106 if itis determined that the presentation device 202 at location 102 is aholographic display device or otherwise has the ability to presentholographic images, while 3D content based on active shuttering can begenerated for the media processor 106 of location 502 if it isdetermined that capabilities at that location warrant this format. Inone embodiment, the selection and generation of the format of the objectcontent can be based on capability determinations being made by thedevices of system 500, such as the computing device 130 querying thelocal devices for display capabilities and/or accessing user profiles orpast history information to make the determination. In one embodiment,each of the various formats can be generated without regard to devicecapabilities and a selection can then be made of the correspondingformat to be transmitted.

Other suitable modifications can be applied to the present disclosurewithout departing from the scope of the claims below. Accordingly, thereader is directed to the claims section for a fuller understanding ofthe breadth and scope of the present disclosure.

FIG. 8 depicts an exemplary diagrammatic representation of a machine inthe form of a computer system 800 within which a set of instructions,when executed, may cause the machine to perform any one or more of themethodologies discussed above. In some embodiments, the machine operatesas a standalone device. In some embodiments, the machine may beconnected (e.g., using a network) to other machines. In a networkeddeployment, the machine may operate in the capacity of a server or aclient user machine in server-client user network environment, or as apeer machine in a peer-to-peer (or distributed) network environment.

The machine may comprise a server computer, a client user computer, apersonal computer (PC), a tablet PC, a laptop computer, a desktopcomputer, a control system, a network router, switch or bridge, or anymachine capable of executing a set of instructions (sequential orotherwise) that specify actions to be taken by that machine. It will beunderstood that a device of the present disclosure includes broadly anyelectronic device that provides voice, video or data communication.Further, while a single machine is illustrated, the term “machine” shallalso be taken to include any collection of machines that individually orjointly execute a set (or multiple sets) of instructions to perform anyone or more of the methodologies discussed herein.

The computer system 800 may include a processor 802 (e.g., a centralprocessing unit (CPU), a graphics processing unit (GPU, or both), a mainmemory 804 and a static memory 806, which communicate with each othervia a bus 808. The computer system 800 may further include a videodisplay unit 810 (e.g., a liquid crystal display (LCD), a flat panel, asolid state display). The computer system 800 may include an inputdevice 812 (e.g., a keyboard), a cursor control device 814 (e.g., amouse), a disk drive unit 816, a signal generation device 818 (e.g., aspeaker or remote control) and a network interface device 820. Thedevices of computer system 800 can be found in the previously shownfigures, such as camera 175, media processor 106, TV 202 and so forth.

The disk drive unit 816 may include a machine-readable medium 822 onwhich is stored one or more sets of instructions (e.g., software 824)embodying any one or more of the methodologies or functions describedherein, including those methods illustrated above. The instructions 824may also reside, completely or at least partially, within the mainmemory 804, the static memory 806, and/or within the processor 802during execution thereof by the computer system 800. The main memory 804and the processor 802 also may constitute machine-readable media. Theinstructions 824 can include one or more of the steps described above,including calibration steps, such as determining or interpolating viewerdistance, determining convergence from viewer distance, and so forth.

Dedicated hardware implementations including, but not limited to,application specific integrated circuits, programmable logic arrays andother hardware devices can likewise be constructed to implement themethods described herein. Applications that may include the apparatusand systems of various embodiments broadly include a variety ofelectronic and computer systems. Some embodiments implement functions intwo or more specific interconnected hardware modules or devices withrelated control and data signals communicated between and through themodules, or as portions of an application-specific integrated circuit.Thus, the example system is applicable to software, firmware, andhardware implementations.

In accordance with various embodiments of the present disclosure, themethods described herein are intended for operation as software programsrunning on a computer processor. Furthermore, software implementationscan include, but not limited to, distributed processing orcomponent/object distributed processing, parallel processing, or virtualmachine processing can also be constructed to implement the methodsdescribed herein.

The present disclosure contemplates a machine readable medium containinginstructions 824, or that which receives and executes instructions 824from a propagated signal so that a device connected to a networkenvironment 826 can send or receive voice, video or data, and tocommunicate over the network 826 using the instructions 824. Theinstructions 824 may further be transmitted or received over a network826 via the network interface device 820.

While the machine-readable medium 822 is shown in an example embodimentto be a single medium, the term “machine-readable medium” should betaken to include a single medium or multiple media (e.g., a centralizedor distributed database, and/or associated caches and servers) thatstore the one or more sets of instructions. The term “machine-readablemedium” shall also be taken to include any medium that is capable ofstoring or encoding a set of instructions for execution by the machineand that cause the machine to perform any one or more of themethodologies of the present disclosure.

The term “machine-readable medium” shall accordingly be taken toinclude, but not be limited to: solid-state memories such as a memorycard or other package that houses one or more read-only (non-volatile)memories, random access memories, or other re-writable (volatile)memories; magneto-optical or optical medium such as a disk or tape.Accordingly, the disclosure is considered to include any one or more ofa machine-readable medium, as listed herein and including art-recognizedequivalents and successor media, in which the software implementationsherein are stored.

Although the present specification describes components and functionsimplemented in the embodiments with reference to particular standardsand protocols, the disclosure is not limited to such standards andprotocols. Each of the standards for Internet and other packet switchednetwork transmission (e.g., TCP/IP, UDP/IP, HTML, HTTP), as well as theexamples for calibration, distance determination, communicationprotocols, and so forth, represent examples of the state of the art.Such standards are periodically superseded by faster or more efficientequivalents having essentially the same functions. Accordingly,replacement standards and protocols having the same functions areconsidered equivalents.

The illustrations of embodiments described herein are intended toprovide a general understanding of the structure of various embodiments,and they are not intended to serve as a complete description of all theelements and features of apparatus and systems that might make use ofthe structures described herein. Many other embodiments will be apparentto those of skill in the art upon reviewing the above description. Otherembodiments may be utilized and derived therefrom, such that structuraland logical substitutions and changes may be made without departing fromthe scope of this disclosure. Figures are also merely representationaland may not be drawn to scale. Certain proportions thereof may beexaggerated, while others may be minimized Accordingly, thespecification and drawings are to be regarded in an illustrative ratherthan a restrictive sense.

Such embodiments of the inventive subject matter may be referred toherein, individually and/or collectively, by the term “invention” merelyfor convenience and without intending to voluntarily limit the scope ofthis application to any single invention or inventive concept if morethan one is in fact disclosed. Thus, although specific embodiments havebeen illustrated and described herein, it should be appreciated that anyarrangement calculated to achieve the same purpose may be substitutedfor the specific embodiments shown. This disclosure is intended to coverany and all adaptations or variations of various embodiments.Combinations of the above embodiments, and other embodiments notspecifically described herein, are contemplated by the presentdisclosure.

The Abstract of the Disclosure is provided with the understanding thatit will not be used to interpret or limit the scope or meaning of theclaims. In addition, in the foregoing Detailed Description, it can beseen that various features are grouped together in a single embodimentfor the purpose of streamlining the disclosure. This method ofdisclosure is not to be interpreted as reflecting an intention that theclaimed embodiments require more features than are expressly recited ineach claim. Rather, as the following claims reflect, inventive subjectmatter lies in less than all features of a single disclosed embodiment.Thus the following claims are hereby incorporated into the DetailedDescription, with each claim standing on its own as a separately claimedsubject matter.

1. A server comprising: a memory; and a controller coupled to the memoryand adapted to: obtain media content; receive first images that arecaptured by a first camera at a first location associated with a firstuser; receive second images that are captured by a second camera at asecond location associated with a second user; provide the media contentand second video content representative of the second images to a firstprocessor for presentation at a first display device utilizing a firsttelepresence configuration that includes the second user, wherein thefirst processor and the first display device are associated with thefirst user and located at the first location; and provide the mediacontent and first video content representative of the first images to asecond processor for presentation at a second display device utilizing asecond telepresence configuration that includes the first user, whereinthe second processor and the second display device are associated withthe second user and located at the second location, wherein theproviding of the first and second video content is associated with acommunication session between the first and second users, wherein atleast one of the first video content and the second video content isadapted for presentation as three dimensional content by thecorresponding first and second processors.
 2. The server of claim 1,wherein the first telepresence configuration comprises images of thesecond user in the second video content with an adjusted orientation tosimulate the images of the second user viewing the media content.
 3. Theserver of claim 2, wherein the first telepresence configurationcomprises the images of the second user in the second video content withthe adjusted orientation to simulate turning to face the first user inresponse to speech from the second user being detected.
 4. The server ofclaim 1, wherein the first images captured by the first camera are twodimensional images, and wherein the controller is adapted to generatethe three dimensional content for the first video content based on thetwo dimensional first images.
 5. The server of claim 1, wherein themedia content is adapted for presentation as three dimensional content.6. The server of claim 1, wherein the media content is obtained by thecontroller as two dimensional content, and wherein the controller isadapted to convert the media content into three dimensional content. 7.The server of claim 1, wherein the media content is obtained by thecontroller as two dimensional content, wherein the first and secondimages are two dimensional images, and wherein the controller is adaptedto obtain depth maps corresponding to each of the media content, thefirst images and the second images for converting the media content, thefirst images and the second images into three dimensional content. 8.The server of claim 7, wherein the first processor is included in afirst set top box, and wherein the controller is adapted to: transmitthe depth maps corresponding to the media content and the second videocontent to the first processor for generating the three dimensionalcontent of the media content and the second video content.
 9. The serverof claim 1, wherein the controller is adapted to: access a user profileassociated with the first user; and determine whether the first videocontent is to be presented as two dimensional or three dimensionalcontent based on a preference of the user profile.
 10. The server ofclaim 1, wherein the controller is adapted to provide one of the firstand second processors with control over presentation of the mediacontent of the first and second telepresence configurations.
 11. Amethod comprising: obtaining first images that are captured by a firstcamera system at a first location associated with a first user;transmitting first video content representative of the first images overa network for presentation by a second media processor at a secondlocation associated with a second user; receiving at a first mediaprocessor of the first location, media content and second video contentrepresentative of second images that are associated with the seconduser; and presenting at a first display device of the first location,the media content and the second video content in a first telepresenceconfiguration that simulates a presence of the second user at the firstlocation, wherein the media content and the first video content isadapted for presentation by the second media processor in a secondtelepresence configuration that simulates a presence of the first userat the second location, and wherein at least one of the media content,the first video content and the second video content is presented asthree dimensional content.
 12. The method of claim 11, comprising:capturing the first images as two dimensional images; and transmittingthe first images to a remote server for conversion into threedimensional content.
 13. The method of claim 11, wherein the mediacontent is selected via a social network application.
 14. The method ofclaim 11, comprising: receiving at the first media processor, a depthmap associated with at least one of the media content and the secondvideo content; and generating at the first media processor, threedimensional content based on the depth map.
 15. The method of claim 11,wherein the three dimensional content is holographic content providingdifferent viewing perspectives based on viewer position.
 16. The methodof claim 15, comprising: obtaining a plurality of first images utilizinga plurality of cameras of the first camera system, wherein each of theplurality of first images are captured from different angles, andwherein the plurality of first images are used to generate theholographic content by at least one of the first media processor, thesecond media processor and a remote server.
 17. The method of claim 11,comprising presenting at the first display device the media content asthree dimensional content.
 18. A non-transitory computer-readablestorage medium comprising computer instructions to: obtain media contentat a server; receive at the server, first video content of a first userat a first location; receive at the server, second video content of asecond user at a second location; receive at the server, third videocontent of a third user at a third location; transmit from the server,the media content and the second and third video content to a firstmedia processor for presentation at a first display device thatsimulates the second and third users being present at the firstlocation; transmit from the server, the media content and the first andthird video content to a second media processor for presentation at asecond display device that simulates the first and third users beingpresent at the second location; transmit from the server, the mediacontent and the first and second video content to a third mediaprocessor for presentation at a third display device that simulates thefirst and second users being present at the third location; and adjustan orientation of images of the first user being presented at the secondand third display devices in response to speech of the first user beingdetected, wherein the adjusted orientation is based on simulating theimages of the first user speaking to the second and third users.
 19. Thenon-transitory computer-readable storage medium of claim 18 comprisingcomputer instructions to select the media content based on at least oneof signals received from a social network application associated withthe first, second and third users and a negotiation process executed bythe first, second and third media processors.
 20. The non-transitorycomputer-readable storage medium of claim 18, wherein at least one ofthe first video content, the second video content and the third videocontent is adapted for presentation as three dimensional content.